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Abstract. We consider a A-coalescent and we study the asymptotic behavior of the total 
length Lg"j of the external branches of the associated n-coalescent. For Kingman coalescent, 
i.e. A = So, the result is well known and is useful, together with the total length L^"\ for 
Fu and Li's test of neutrality of mutations. For a large family of measures A, including 
Beta(2 — a, a) with < a < 1, Mohle has proved asymptotics of I/e"^. Here we consider 
the case when the measure A is Beta(2 — a, a), with 1 < a < 2. We prove that n^^^ig"! 
converges in to a{a — l)r(a). As a consequence, we get that Lg"(/L'"' converges in 
probability to 2 — a. To prove the asymptotics of I/e"t) use a recursive construction of 
the n-coalescent by adding individuals one by one. Asymptotics of the distribution of d 
normalized external branch lengths and a related moment result are also given. 



1. Introduction 

1.1. Motivation. In a Wright-Fisher haploid population model with size A^, we sample n 
individuals at present from the total population, and look backward to see the ancestral tree 
until we get to the most recent common ancestor (MRCA). If time is well rescaled and the 
size A'^ of population becomes large, then the genealogy of the sample of size n converges 
weakly to the Kingman n-coalescent (see |25j.|26j). During the evolution of the population, 
mutations may occur. We consider the infinite sites model introduced by Kimura [24]. In this 
model, each mutation is produced at a new site which is never seen before and will never be 
seen in the future. The neutrality of mutations means that all mutants are equally privileged 
by the environment. Under the infinite sites model, to detect or reject the neutrality when 
the genealogy is given by the Kingman coalescent, Fu and Li[T7] have proposed a statistical 
test based on the total mutations numbers on the external branches and internal branches. 
Mutations happened on external branches affect only single individuals, so in practice they 
can be picked out easily according to the model setting. In this test, the asymptotics of the 
ratio Lg"j/L(") between the total length of the external branches Lg"| and the total length 
Iji^) measures in some sense the weight of mutations happened on external branches among 
all. 

For many populations, Kingman n-coalescent describes the genealogy quite well. But 
for some others, when descendants of one individual can occupy a big ratio of the next 
generation with non- negligible probability, it is no more relevant. It is for example the case 
of some marine species(see [l], [7], [E], [H], [2l])- In this case, if time is well rescaled and the 
size of population becomes large, the ancestral tree converges weakly to the A n-coalescent. 
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where multiple collisions may appear. This class of coalescents has first been introduced by 
Pitman p9] and Sagitov[30]. Among A n-coalescent, a special and important subclass is called 
Beta n-coalescents characterized by A being a Beta distribution. The most popular ones are 
for the choice of parameters 2 — a and a for a € (0, 2). 

Beta{2 — a, a) n-coalescents arise not only in the context of biology. They also have con- 
nections with supercritical Galton- Watson process (see |31]). with continuous-stable branch- 
ing processes (see [1]), with continuous random trees (see [2]). If a = 1, we recover the 
Bolthausen-Sznitman n-coalescent which appears in the field of spin glasses(see [6], [8]) and 
is also connected to random recursive trees (see [20]). The Kingman coalescent is also ob- 
tained from the Beta{2 — a, a) coalescent by letting a tend to 2. 

For Beta{2 — a, a) n-coalescents with < q < 2, various quantities have be studied under 
some conditions on q, especially the law of the length of an external branch taken at random, 
the total length of external branches Lg"^ and the total length L^"). A short survey is given 
in the next section. In this paper, we consider Beta{2 — a, a) n-coalescent with 1 < a < 2. 
Asymptotics of the normalized length of an external branch taken at random have recently 
been proved by Dhersin et al in [11]. Here, we extend this result and give the asymptotics of 
the joint distribution of the lengths of a finite family of external branches taken at random, 
and use this result to get the asymptotics of L^^^. As a consequence, we get that the sequence 
Lg"j/L^") converges in probability to 2 — a. 

1.2. Introduction and main results. Let i5 be the set of partitions of N := {1,2,3,...} and, 
for 1 < j < n < oo , Sj^n the set of partitions of Nj^n '■= + 1, • • • , n} if n G N and Nj^oo := 
{i) J + 1) ■ ■ ■ }• We denote by the natural restriction on Sj^n- if 1 < / < j < n < ?n < oo 

and TT = {Ai}i(zj is a partition of N/^mi then p^-^'"^ is the partition of Nj^^ defined by p^^'"'\ = 
{Aif]Nj^n}i<^i- For a finite measure A on [0,1], we denote by 11 = {Ilt)t>o the A-coalescent 
process introduced independently by Pitman[29] and Sagitov[30]. The process (Ilt)t>o is a 
cad-lag continuous time Markovian process taking values in £ with Hq = {{1}, {2}, {3}, ...} 
which is characterized by the cad-lag A n-coalescent processes (n|"^)t>o := (/o(^'")nt)j>o, n G 

N. For n < m < oo, we have (n|"-')t>o = (/9(^'")n5"'^)t>o (where n(°°) = B). This property 
is called the consistence property. 

We introduce the measure v{dx) = x~^A((ix). For 2 < a < 6, we set 

Afe,a= r x^-^l-xf-^Aidx) = C x''{l-xf-''v{dx). 
Jo Jo 

For fixed n G N, we can think of (n|"')j>o as a random tree with n leaves having labels 
from 1 to n. This is a Markovian process with values in £n, and its transition rates are given 
by: for ^, ry G £^n! Q^,r] = ^b,a if ^ is obtained by merging a of the 6 = |C| blocks of ^ and letting 
the b — a others unchanged, and q^^rj = otherwise. We say that a individuals (or blocks) 
of ^ have been coalesced in one single individual of r/. Remark that the process n*-") is an 

exchangeable process, which means that, for any permutation r of Ni^.„; t ° B^") = B^"). 
The consistence property is equivalent to the equation: 

(1) \,a = \+l,a+l l,a- 

This is Pitman's structure Theorem, see Lemma 18 in [29]. This relationship comes from the 
fact that a given merging blocks among b can coalesce in two ways while revealing an extra 
block : either the coalescence event implies the extra block (and then a + 1 blocks merge), 
either not. 
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Remark that II^") finally reaches one block. This final individual is called the most recent 
common ancestor (MRCA). We denote by r^"^ the number of collisions it takes for the n 
individuals to be coalesced to the MRCA. 

We define by = {Rt^)t>o the block counting process of {Iit^)t>o- R^^ = 
which equals the number of blocks/individuals at time t. Then i?^"^ is also a continuous time 
Markovian process taking values in N^, decreasing from n to 1. At state 6, for o = 2, 
each of the (^) groups of a individuals coalesces independently at rate Ab^^. Hence, the time 

the process {R^^^)t>Q stays at state h is exponential with parameter: 



b 



b 



a=2 
,1 b 



1 

(1 - (1 - xf - hx{l - xf-^)u{dx) 





We denote by = (y^/"^) k>i the discrete time Markov chain associated with 

This is a decreasing process from Y^^^ = n which reaches 1 at time r^"). The probability 
transitions of Markov chain are given by: for b > 2 and 1 < / < 5 — 1, 

P(yW = b - l\Yt\ = b) = ll±il^, 

9b 

and 1 is an absorbing state. 

We introduce the discrete time process X^") := - yI''\ k > 1 with Xi") = 0. This 
process counts the number of blocks we lose at the A;-th jump. For i £ {1, . . . ,n} define 

t/") :=inf{t|W^nS")} 

as the length of the ith external branch and T^"^ the length of a randomly chosen external 
branch. By exchangeability, T-^^ = T^^\ We denote by := X^"=i 7^^"^ the total length 
of the external branches of H^"), and L^") the total length of II^"). Recall that the block 
counting process 

R{n) 

stays at state b an exponential time with parameter gi,. Hence we have 



r(")-l y{n) 



k=0 ^Yt' 

where ek,k>0, are independent exponential random variables with parameter 1. 

For several measures A, many asymptotic results on the external branches, their total 
length, and the total length of the A n-coalescent are already known. 

(1) If A = (^0) Dirac measure on 0, n(") is the Kin gman n-coalescent. Then, 

(a) nr(") converges in distribution to T which is a random variable with density 

M^) = (IW^-^o (^^^ 13, [22]). 

(b) Lg"| converges in to 2 (see [17], [H]). A central limit theorem is also proved 
in p2] . 

(c) I converges in probability to 1 , and a central limit theorem is also known 
(see [inilll]). Hence (log njL^/^^"^ 

converges in probability to 1. 
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(2) If A is the uniform probability measure on [0,1], II^") is the Bolthausen-Sznitman 
n-coalescent. 

(a) (log n)T('^) is asymptotically standard exponentially distributed (see |16j). 

(b) ^^^SJli^in) converges in probability to 1 and a central limit theorem is also known 
(see [EKE]). 

< oo, which includes the case of the Beta{2 — a,a) n-coalescent 

with < a < 1, 

(a) T(") converges in distribution to Exp{v^i) (see [181 [28] )• 

(b) L^^^ /n converges in distribution to a random variable L whose distribution coin- 
cides with that of e~^^dt, where is a certain subordinator (see page 1405 
in [H] and [27] ). 

(c) Lg^^j/L^") converges in probability to 1 (see [28]). 
(4) If A is Beta{2 - a, a) with 1 < a < 2. 

(a) n"-iT(") converges in distribution to T which is a random variable with density 
function /t(x) = j^rTjr(^{l + ^rf^)~~"^lx>o (see[ll]). 

(b) n°-2L(") converges in probability to r(a)(see [2]). A central limit theorem 
is also proved in [23] (see also [TO]). 

In the rest of the paper, we only consider Beta(2 — a, a) n-coalescents with 1 < a < 2. 
Prom now on, we omit (2 — a, a) and 1 < a < 2, and call it Beta n-coalescent, or n-coalescent. 

(n) 

The main result of the paper is the following asymptotic behavior of L^^^- 
Theorem 1. For 1 < a < 2, the following convergence in holds: 

n^-'L^:l a{a-ma). 

Using this result and the asymptotics for L^"'\ we immediately get that 
Corollary 2. For 1 < a < 2, the following convergence in probability holds: 

—7^ > 2- a. 

It shows that for Beta n-coalescents with 1 < a < 2, the total length of the external 
branches and the total length are equivalent in the limit up to the multiplicative constant 
2 — a. Recall that for Kingman n-coalescent, (log n)Lg"|/L('") converges in probability to 1, 

and hence Lg^^/L^"") converges in probability to 0. We know that Beta n-coalescent converges 
weakly to Kingman n-coalescent as a tends to 2. This Corollary shows a nice continuity 

property for -^fSj- 

To prove Theorem [H we first need the asymptotic behavior of the joint distribution of d 
external branch lengths. 

Theorem 3. For fixed d G N, and T^\t!P\--- ,Tj"^ the lengths of the external branches 
of individuals 1,2, ... we have the following convergence in distribution: 

[n 1^ ,n ±2 1^ ) > [Ii, I2, ■ ■ ■ , J-d) 

n— s-oo 

where Ti,T2, - ■ ■ , are i. i. d variables with density function 

■^^(^) = 7 i\-r^ n (1 + Z J "~"^lx>o- 

[a — Ijl [a) aL [a) 
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This is an extension of Theorem 5.2 of [TT] which is vahd only for d = 1. This Theorem 
shows that the influence of each other individual among the d selected individuals is negligible 
face to the increasing number of other companions. In the limit, the existence for any multiple 
collision involving at least two of the d individuals has a negligible probability. When n 
tends to oo, each of the d selected individuals is asymptotically coalesced to the other n — d 
individuals, independently of other d — 1 individuals. 

To prove this, we use a recursive construction of the A n-coalescent. This construction has 
already been used in [11]. It is to add individuals one by one according to consistence. In 
our case, we add the chosen d individuals one by one to the rest of the n — d individuals. 
The coalescence behavior of each individual can be investigated precisely which leads to 
Theorem [3l 

The following Proposition gives a moment convergence for the normalized lengths of ex- 
ternal branches. 

Proposition 4. Let T he a random variable with density function 

[a — Iji [a) ai [a) 

(1) Let 13 be a positive real number. 

(a) Ifl3< then ^lim^E[(n"-^r|"y] = E[r^]. 

(b) ///3 > then lim E[(n"-^r|"V] = oo. 

(2) For fixed (i G N, and /3i, /32, /3rf non-negative real numbers such that ft < ^^r^; 
we have 

lim E[(n°-iT-,("yi(™""^ri"V' • • • {n''-^Ti''^y^] = E[T^m[T'^^] ■ ■ ■ E[TM. 

n— >-oo 

Notice that the first part of the Proposition is suitable for any positive moment of n"~^T^^"^ . 

The constant ft = -^^^ appears as a critical point. For /5 < ^j^y, we have E[(r^^"y] ~ 

E[T^]n(^~")^, but for /3 > ^j^, no equivalence could be found for E[(T("'))^]. This becomes 
a curious point. 

For the second part of the Proposition, we do not know whether Pq = is a critical 
point or not. 

Theorem [1] appears as a by-product of Theorem [3] and Proposition HI 

1.3. Organization of this paper. In Section 2, we describe the recursive construction. 
This construction is suitable for any A n-coalescent and is essential for the rest of the paper. 
In Section 3, we prove Proposition [H Theorem [T] and Corollary [2] are proved in Section 4. 

2. Recursive construction of the ti-coalescent process 

Recall that if n > 3 and 1 < i < j < n, for any partition vr G £i^n, then p^^''^^Tr denotes 
the natural restriction of vr to + 1, . . . ,n}. Let n > 3 and 1 < d < n — 1. Recall that 
if n^") is a n-coalescent, then p('^+i'")n(") is a n — d-coalescent process on {d + 1, - ■ ■ ,n}. 
In this section, we consider n^'^"''^'"'), a realization of the n — d-coalescent process on {d + 
1, • • • , n}, and we explain how to add successively individuals d, d — 1,. . .,1 to 11^''+^'") to get 
Yl(d,n) ^Yl{d-i,n) . .,n(i'") = n(") such that p(i'")n(*'") = n{J», 1 <i < j < d + 1. 

The above construction is called recursive construction. If d = 1, the construction is 
described in [TT]. We recall it for self-contentedness. 
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2.1. The case d = 1. In this part, we consider H^^'") a coalescent on {2,...,n} and we 
construct a coalescent II^^'") = H^") on {1,2,..., n} such that p(2,n)-Q(n) _ jj(2,n) sticking 
individual 1 to H^^'"). 

There are two possibilities to stick individual 1. 

• Between two successive collisions of n*-^'"^ (see Figure [I]) 

Let us call u and v the times of two successive collisions. We assume that individual 
1 is not stuck at time u. For t S {u,v), the number of individuals of IIj^'"^ (which is 
the number of its components) is constant. We call it 6, and denote by 11,12 ■■■ ,ib 
the individuals. Each of these b individuals attracts independently individual 1 at 
rate Xb+i^2- That means that b independent exponential random variables with 
parameter A;,+i,2 are attached to the individuals, and if r = u + e^g = infjti + e^, 1 < 
k <b} < V, then individual 1 is stuck to individual ikg (for the example in Figured! 
ikg = {2, 3}). Notice that individual 1 is coalesced at rate bXb+i^2- If it is stuck, then 
we define n(") by 

— for t < T, the partition is the partition 11^^'"^ and the singleton {1} ; 

— for t > T, the partition IIj"^ is obtained from the partition n^^'""^ by adding 
individual 1 to the block which has decendant ikg at time r. 

In this case, individual 1 is involved in a binary collision of n^"). 

• At a colhsion of n^^'") (see Figure [2]) 

We consider a collision of XI^^'"), say at a time u. Assume that individual 1 is 
not stuck before time u. We denote by 6 = |n^^l"^| the number of individuals just 
before the collision, and by k the number of individuals involved in this collision 
{k = |n|j^_l"^| — ini^'"^! + 1). Then we decide that individual 1 participates to this 
collision with probability 

Xb+i,k+i/9b ^ fo x^+^l - x)''-^t/(tix) ^ k-a 
h,k/gb xf'il- xf-f'i^idx) b 

This equality comes from ([1]). If we decide that individual 1 participates to this 
collision, we set t = u and stick 1 at this time to get 11^"^ which means that 
is defined as above on [0,r) and [r, +00). In this case, individual 1 is involved in a 
multiple collision of H^") . 

Remark that the length of the external branch of individual 1 in II^") is T^^^ = r. For 
the rest of the individuals, in general, the length of the external branch is not changed: for 
individual 2 < j < n, we have t]"^ = Tj{U^^'"-'^), the length it had in U^'^'^'K The only case it 
changes is when individual 1 is involved in a binary collision, and it is stuck to an individual 
ifcp which is a singleton {j}. In that case, tJ"^ = t < rj(n^^'")). All the other external 
lengths remain the same. 

With this mechanism, we say that we have stuck individual 1 to n*^^'") conditioning on 
n(2'«) to get n("). Of course, p(2'")n(") = H^^."). 



2.2. The case d > 1. Using the recursive construction described in the previous case, given 
n(d+i,n)^ we add one by one individuals d, d-l,.. .,1 to get H^"^'"), n^"^"!-"),. . .,n(^'") = n("). 
With this construction, we have pO'"-)!!^*'"') = II^-''") for all 1 < i < j < d + 1. 
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1« 1 

2. 1 

3« \ 

4. 

5« 

Figure 1. n = 5. Type 1: individual 1 encounters a binary collision 

1. 1 

2. 1 

3« 1 

4. 

5« 

Figure 2. n = 5. Type 2: individual 1 encounters a multiple collision. 

3. Proof of Theorem [3j 

Before proving Theorem[3l we give a technical Lemma. Recall that r^"^ denotes the number 
of jumps of n("). For 1< i < r('") , we introduce a\^^ the time at which the i-th jump of H^") 
occurs, and for t > 0, r^""^ := sup{i,7i'^~^Af^^ < t}. 

Lemma 1. Let {at)t>o, (&t)t>o two non-negative bounded continuous time stochastic pro- 
cesses and (cj)j>i a non-negative bounded discrete time process. Then, for fixed < ti < t2, 
when n tends to oo; 

(1) the sequence exp(— J^*^ + as)x'^{l — x)^'"^^^°'^^" ^i'{dx)ds) converges 
in probability to ; 

(2) the sequence H — converges in probability to (^w^^xT') ■ 

Proof. For the first assertion, we refer to the arguments used in Theorem 5.2 in [11]. For the 
second assertion, we refer to the arguments used in Theorem 3.1 in [11]. □ 

We are now able to give the proof of Theorem [3l Recall that d G N is fixed, and n > d+\. 
Let := {(xi, • • • , Xd) : Vi G {1, • • • , d}, Xi > 0}. By Portmanteau Lemma (see [3], page 
16), Theorem [3] is equivalent to: for any open set fli C M^j., 

iiminfP((n-ir("\--- , n^-^ri")) E Oi) > P((ri, • • • ,Td) e Qi). 

n— >-oo 

We define A^'^) := {(xi, • • • , x^) : < x^ < Xd-i < ■ ■ ■< xi}. Notice that (n^-^T^^^ • • • .n^-^rj"^) 
and (Ti, • • • ,Tfi) are both exchangeable. Then Theorem [3] is equivalent to: for any open set 

(3) liminfP((n"-^T^"\--- , n^-^rj")) G O2) > P((ri, • • • ,Td) e n^). 

n— >-oo 

Remark that (Ti,--- , T^;) has a continuous density function with respect to Lebesgue 
measure on A^"^). We denote it by fd. For any measurable set A C A^'^), we define Q{A) = 
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Ja fd{x)dx. For any open set C A^^^^ with Q{^2) > and e > 0, we can find a finite 
number of disjoint open rectangles Ki, • • • , Kj^^N > 1 such that 

N N 

\jKi(lVL2 and QiVLi) -e<Y, Qi^i) ^ Qi^^)- 

i=l i=l 

Hence to prove ([3]), it suffices to get the result for any non-empty open rectangle C A^"^), 
i.e. ^2 =]'Si, ti[x]s2, i2[x ••• xjsdjtdi, where < < < • • • < si < ii. We shall prove 
that: 

(4) hm P((7i"-iTf \ • • • ,n°-irS"^) GO2) >lP(m,--- ,Td) e ^2). 



n— >oo 



li 



In fact we prove that if =]si, ti] x]s2> ^2] x - • • x]sd, td], where < < < ■ ■ ■ < si < t 

(5) Jim p((7^"-iT^"\... ,n"-ir]"))E03)=P((ri,--- ,Td) e n^). 

Equality ([5|) directly implies inequality ([!]). 

For simplicity, we first prove the case d = 2, and then explain the general case. 
First step: case d = 2. 

We consider H^^'"), and construct II^") by adding at first individual 2, then adding individ- 
ual 1, as explained in the previous Section. The event {si < re^-^T^^"^ < ti,S2 < n°'-^T!f''^ < 
t2} is then the event where the time at which individual 2 is stuck is within (n^~"s2, 'n^~'^t2]^ 
and then individual 1 is stuck within (n^~°si, n-'^~"ti]. Notice that when individual 2 is 
added, its external branch length is not changed by the coalescence of individual 1, and is 
equal to T2 . Let us write: 

P(si < n^-^rf ^ < ti,S2 < n"-iT2^"^ < 12) 

= P(s2 < n"-ir2^"^ < t2)F{si < n"-iT^"^ < ti\s2 < n"-^T^''^ < tg). 

By Theorem 5.2 in [11], n"~^r(") converges in distribution to T which is a random variable 
with density function /t(x) = (^ri)r(^(l + ^f^) ""^ ^x>o- So we get that P(s2 < 
fiO-ij'i'^) < converges to 

<T<t2) = {i + ^?f^)"^ - (1 + 

aT{a) ar(a) 
Hence it remains to prove that, when n tends to 00, 

(6) P(si < n°-irf ) < ti\s2 < n'^-^Tlf''^ < 12) ^ (1 + —J^)"^ - (1 + ^^^"^ 



aT{a) aT{a) 

Once individual 2 is stuck to H^^'^^ it gives H^^'"^. Since individual 1 is stuck to n^^'"'^ 
we need to distinguish different behaviors of 2. As stated for the case d = 1, there are two 
possibilities for individual 2 to be stuck. The first is that individual 2 is involved in a binary 
collision within (n^~"s2, f^^~"^2]• The second is that individual 2 is involved in a multiple 
collision of H^^'") within (n^~"s2, ^-^""^2]- 

We denote by A^^'"^ the first possibility, and by Ag^'"^ the second possibility. Then {s2 < 
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(3,n), 



p(^(3,n)y^(3,n)^ 

P(si < n°-iTf ^ < ti|Af'"^) +P(a5^'"^] 



(3,n). 



(3,n)x 



Hence to get ([6]) it is enough to prove that, when n tends to oo: 

(Li) : P(si < n--'Ti-^ < ti|Af ^ (1 + ^)-^ - (1 + 

ar(a) ar(a) 

(L2): < n^-^Ti^") < ti|A?")) ^ (1 + - (1 + 



Q;r(a) ar(a) 
Proof of (Li): On the event A^'"'', we introduce i*,t* such that individual 2 is coalesced 
at time t* and t* is strictly between the i* — 1-th and i*-th jump moment of H^'^'"') . We denote 
by r(3.") total jump times of n('^'"). For i G N, we define a\^'^^ the time of the i-th jump 
moment of n(^'"') (hence 1 < i < r^'^'")), and for t > 0, r^'"'' := sup{i, n°'~^A^^'^^ < t}. 

Let (ap'"^)f>o be the continuous time stochastic process such that a^'"^ = l[o^t*)(t), and 
{b[^'^^)i>i be the discrete time stochastic process such that 6^^'"^ = 1, if 1 < i < i* — 1, and 
^{3,n) _ Q Qi^jjgj-^igg^ Then 



(si < n^-^r^^") < ii|Ai' 



(3,n); 



E[exp(- /''ni"°(i?2ri+af'"))A^(3,„) +^(3,„)^, ^^s) 



J3.n) 



(3,n) Y{3,n) , (3,n) -, 

j-j- -'i-i - +"i + a - i 



(l-exp(-/ ni-(i?f„'ri)A^(3,„) ^^,ds) n ^ 



(3,n) 

^(3,n) _ (3,n) ^ ^ _ ^ 



(3,n) . -, 



(3,n) 



(3,n)i 



j-1 
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Recall that A;,,^ = Jq x^il - xf~''v{dx) with 2 < a < 6. Then 



P(si <n'^~'^T[^^ <ti|Ap'")) 

=E[exp(- +oi3'"))x2(l-x)^ln^-+''^''"^-V(dx)ds) 

J3.n) 



^0 



y.(3,n) + g - ygn) _ ^(3.n) ^ ^(3,n) ^ ^ _ 

+ j=l + Oj 



(l - exp(- ni--(/2f^'r2 - 



(3,n) 



ii y(3,n) jl^l J 



l = r), ' +1 ' ^ 



Notice that 



(1) exp(- f ' [ ni-°(i?^^v"2,+af'"))x2(l-x)^l^"--+''^''"'~V(dx)(is) is the probability 

Jo Jo 

for individual 1 not to have a binary collision within [0,n^"'^si]; 
y.(3,n) + a - 1 

(2) — — r is the probability that individual 1 is not involved in a collision which is 

y/r7 + 1 

due to a binary collision between individual 2 and n^'^'"^ Notice that R^"^'"^^ = R^'^'^\ 
By Theorem 4.4 in [U], y.?_:"^ = > R^^i-l^ — >-oo when n tends to oo, so the 

probability ' ^g^^^ converges to 1; 



„(3.") 



y.(_3.«) _ ^(3,n) + 5(3,") ^ ^ _ ;l 

(3) 7^ — r 7^ — r is the probability for individual 1 not to be coa- 

J.J. ^[i,nj i(,3,nj 

i=l + 

lesced in a multiple collisions of 11'^'^'") within time [0, n^~"si]; 



(3,n) 

^ti \ri3,n) -5^(3, n) 



(4) l-exp(-y y ^ni-'^ v{dx)ds) J_| 



(3,n,) 

^ i 

is the probability for individual 1 to be coalesced within (n^^^si, n^~"fi]. 

d 



Then using 

^(n-2) ^ ^(3,n.) Lemma [H we get (Li 

Proof of {L2): On the event A2^'"''', we introduce t** and i** such that individual 2 is 
coalesced at time t** which is the z**-th jump moment of II^'^'") 

Let (c(^'"))f>o be the continuous time stochastic process such that c^'"^ = l[o,t**)(t), and 
((i(^'"-')i>i and (e^^'"^)i>i the discrete time stochastic processes such that if 1 < z < i** — 1, 

,(3,n) (3,n) ^ • ^ i**,df ,n) ^ (3,n) ^^^^^^ -(3,n) ^ (3,n) ^ ^ otherwise. 
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P(A(3.«)|A^^'")) 



E[exp(- / ni-"(i?2ri + '"^)A^(3.„ 

^0 sn^' 



J3,n) 



rs^ y{3,n) _ ^(3,n) ^ ^(3,n) ^ ^ _ 
i=l 



_|_ „(3,n) 



(3.n) 

y.(3,n) _ ^(3,n) ^ ^ _ ^ 



E[exp( 

J3,n) 

n 



Jo 



{2,n) , ^(3,n) 



^(3,?i) 



u{dx)ds) 



|A 



3, 711 



1=1 



• (l - exp(- ni-"(i?f^'ri)x2(l - X 



l_„.p(3,n) _x^^1!TL-1 



u[dx)ds) 



Isi JO 

„(3,n) 

n 

{3,ri) I 



^^(3,n) _ ^(3,n) ^ ^ _ ^ 



y;. 



(3,n) 



i-l 



Compared with (Li), in this case, no new colHsion is created by individual 2, and at the i**- 
th coUision of H^^'"-)^ which is also the collision where individual 2 is involved, the probability 

for individual 1 to be coalesced is '**~^ (3 . 

Then using 

^ ^(3,n) Lemma [H we get (L2). So we get the case d = 2. 
Second step: d > 3. 

The event {sd < n^-^Tj"^ < id,-- - ,si < n"-^T^^"^ < ti} can also be interpreted as 
above. Given n^'^"''^'"), we first add individual d which has to be stuck at a time within 
(n^~"srf, n^~'^t(i\, then we add individual d—1 which has to be stuck within {n^~'^ s^-i, 'n^~°'t(i^i], 
etc. Since every individual is coalesced in a different interval, the external branch length of 

(n) 

individual i when it is initially connected is exactly T-^ , 1 < i < d. We can then consider 

(n) 

as the external branch length of individual i when it is initially added. 
We use the decomposition 

F{sd < n"-ir(") < trf, • • • , si < n-ir,(") < h) 
=F{sd < n"-iri") < td)nsd-i < n^-^T'^J'X < td-i\sd < n^-'T^P^ < td) 

■ ■ ■ F{si < n^-^T,^") < ti|s2 < n^-^rj") < t2, • • • , < n^-^T^") < td). 
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Notice that by Theorem 5.2 in [11], lP(sd < n^^^rj'"^ < t^) converges to (1 + -^^V^ - 

(1 + Q.r(a) )"°^ ■ Then to prove Theorem [3l it suffices to extend this result to the following: 
for any 2 < i < d, 

1 , 1 

converges to (1 H — Sh^) "-^ — (1 H — • The idea is the same as in the case d = 2. 

We split the event {s^+i < n"~^T.^"| < tj+i,--- , < n°~^Tj"^ < into several classes. 
In each class, every individual from i + 1, • • • , (i is specified whether it encounters a binary 
collision or it is implied in a multiple collision of n^'^"''^'"). It is easy to see that there are 2^^"* 
disjoint classes which we denote by Ai, • • • , A2d-i: 

2d-i 

p=i 

Then we have 

2d-i 

p=i 

_ e;=i>(^^<^"-^^i^"^<^^.a,) 

_ Er="i' 'P(Ap)P(^^ < n"-ir,^"^ < t,|Ap) 

p(u;=; A,) 

For each P(sj < n""^T^^"'^ < tj|Ap), 1 < p < 2*^"*, the method is in the case d = 2, and the 
details are left to the reader. Together with Lemma [U this gives equality ([5]), hence Theorem 

El 

4. Proofs of Proposition [H 
In this Section, we give the proof of Proposition [H First of all, we give 2 technical lemmas. 

Lemma 2. Let {Xi)i>i,X be random non-negative variables such that (Xj)j>i converges in 
distribution to X. Assume that there exists /3 > 1, and C > 0, such that for all i > 1, 
mXif] < C, then E[Xi] converges to E[X]. 

Proof. Thanks to Skorohod's representation theorem (see [3] page 70), we can construct 

random variables {Yi)i>i,Y such that for any i G N, 1^ = Xi, Y = X and {Yi)i>i converges 
almost surely to Y. By Fatou's Lemma, E[X^] = E[Yf^] < lim inf„_^oo IE[(>^,)'^] < C. 
We use the decomposition: Yi = Yil^Y^^y+i} + ^jl{yi>y+i}- 

For the first part, E[lil jy,<y^^}] converges to E\Y] by the dominated convergence theorem. 
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For the second part, using Holder's inequality, we have 

E[ya{y,>y+i}] <E[yf]m{Y.>Y+i}])^ 

=E[y/](P({y, > y + 

<C{¥{{Yi>Y + l}))^, 

which converges to 0. 

Then K[Yi] converges to E[y]. This achieves the proof. □ 
Lemma 3. Let a > 0, 6 > 0, /3 > 2. Then 

0<ia + bf <a^ + + (32^-^ ab^-^ + (32^-V-\ 
Proof. If < m < 1, then 

(1 + m)'' < 1 + f32^^^m < 1 + + + j32^-'^m^-^. 

We have used that the function m i— t- (1 + m)^ is convex and that /32^~^ is the derivative of 
(1 + m)^ at m = 1. 
If 1 < m, then 

(1 + mf = m^(l + —f < (mf(l + /32^~i — ) < 1 + + /32^- V + l32'^~^m'^-\ 
m m 

Hence for all m > 0, 

(1 + m)'^ < 1 + + /32^-im + /32^-^m'^-^ 
Then for ah a > 0, 5 > 0, 

(a + bf = aHl + -)^ 
a 

< + (^^f + f]2^-i^ + /32'^-i(-)/3-i) 
= a^ + b^ + /32^-'ab^-' + f32^-V-\ 

This achieves the proof. □ 
We are now able to prove Proposition HI We first prove the one-dimensional result. 

Proof of the first part of Proposition |4} 

Recall that T is a random variable with density function 

Mx) = 7 TTFTT^l + -Fr^)~^~^'^x>o- 

[a — Iji [a) ai [a) ~ 

We prove here for /3 being a positive real number. 

(1) If /3 < then lim Effn^-^Ti^^yi = E[T'^]. 

(2) If /3 > then lim Effn^^^Ti^^V] = oo. 
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In the case j3 > -^^j, one gets E[T'^] = oo. Theorem [3] tells that (n" ijr^"))/? converges in 
distribution to T^, so we deduce that E[(n°-irf"y] converges to oo. 

If < /3 < according to Lemma [H it suffice to prove that E[(n"~iT^"V] is bounded 

for aU n. Notice that given < A < then ¥.[{n''-^T['''^f^] < {¥.[{n''-^T['''^)f^^])T^ which 
means that, if 'K[{n°'~^T^^)^^\ is bounded for all n, the same for E[(n"-ir]^"Vi]. bo we 
deduce that it suffices to consider 2 < f3 < We will first prove that (E[n"-iTi^"^],n > 2) 
is bounded, and then that {E,[[n°'~^T^"'^)^], n > 2) is bounded. 
Step 1: (E[n"~iTi^''^],n > 2) is bounded. 

First of all, we remark that for fixed n > 2, we have E,[n"^^T^"'^] < oo. We prove by 
induction on n that if Ci > is a constant large enough, then for all n > 2, we have 
E[n"-ir|"^] < Ci. 

Let n > 3. We assume that there exists Ci > such that for all 2 < A; < n — 1, we have 
E[A;°~^T^^'^''] < Ci. Writing the decomposition of t|"^ at the first collision, we have 

a-lrpin) _ a-l f '^O .ST- -, ^ 

V^" k=2 I 

where i/^ :={we have k individuals after the first coalescence, and individual 1 is not involved 
in this collision.}, eo is a unit exponential random variable, = t'^^ ^ and all these random 
variables cq, ^n, , l{_fffe} a^'^ independent. We have 

n-l 

EK-^rf )] =EK-n- + 5] 

^" fe=2 

n-l 

=n"-iE[— ] + n"-i ^ P(Ffe)E[ri^''^] 
^" fc=2 

n. — 1 1 



^" fc=2 

Q-1 



Remark that n — k + 1 given individuals among n coalesce at rate Xn^n-k+i- Hence 



9n 9n 



We then have 
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9n 3n k 



9n Qn \k-l k{k-l) 



9n \ Qn 

+ 1 gn j 

•1 - xf-^u{dx)' 



9n 



For a technical purpose, we define u*{dx) := j^i'{dx), II^"''*) the n-coalescent process 
associated with i^*{dx), and = Yla=2 O^'^i'^ ~ x)'^~''u*{dx). We have 

Qn 

Jo Ek=2 - X^j^^uidx) 

gn 



JoEU{Ty'-'^Hi-x)'u*idx) 
gn 

g:+l - X),.*idx) + 



Using Lemma 2.2 of [T^, we have gn = 0^(0)''^°' ~^ 0(n°~^). Moreover, while ^^^^ = 
r(a)r(2-a) + 0(3;"") when x tends to 0, then the n-coalescent II^"'*) also satisfies the 

asumptions of Lemma 2.2 of [10]. Hence we get = ^p^^^^ n" + 0(n"~^). Using again 

that ^ = rwrW^) ^"'"" + io' rt>"(l " ^>*idx) = 0{7i~') and 

/ x"~^^h'*{dx) = 0{n~^), and hence 
Jo 



(7) lim 



We have 
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nn'^'^Tf'] < + Ci 1 Jo V k J ^ 1 ^ — i_ 

9n y n + 1 Qn 



^ n\gn ^ n + 1 Qn 



n 

Using d?]) and that — is bounded, we get that there exists Cq > such that, if Ci > Co, 

9n 



Gi — — < U 

9n n + l gn 



for all n > 2. Hence, if Ci is also greater than E[2'^ ^T^'^], we have proved by induction that 
{^[n"-^T[\n> 2) is bounded by Ci. 

n-l 

Remark 5. In the above proof, we get that > ¥(Hk)— = 1 + 0(n~^). This estimation is 

^-^ k 

k=2 

useful for later use. 

Step 2: {E[(^n°'-^Ti'^^y],n> 2) is bounded. 

In this step, we employ the method used previously. First of all, we remark that for fixed 
n > 2, we have K[(^n°'~^T^"'^^ ] < oo. Let n > 3. We assume that there exists C2 > such 

that for all 2 < /c < n - 1, we have E[(ji°'-^T^''^y] < C2. With the notations of the first 
step, and using Lemma Owe have 

n-l 



k=2 

n-l 

< „("-i)/3e[(^)/3] + 1{//,}^1^'V] 

^" k=2 

n—1 rt— 1 



^" fc=2 

(8) = In,l + In,2 + In,3 + -^^,4 
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where 

9n 
n-l 

/.,2 = n("-^)%[(j:i{^,}TfV], 

k=2 

n-l 
k=2 

n-l 

Recall that Qn ^ -—n°'. Hence there exists a constant Ki > such that for all n > 2, 

aT{a) 

Ki Ki 
n'^f n 

because /3 > 2 > 1. 

To bound In,2, we introduce Hq the event that individual 1 is involved in the first collision. 

Remark that Hq = (|Jfc=2 ^kY- Recall that Y^'^'^ is the number of individuals right after the 

first coalescence and = n—Y^^^ is number of individuals lost during the first coalescence. 
Then 

P(i/o) = 1 — P(individual 1 is not coalesced at the first coalescence) 
= 1 — E[P(individual 1 is not coalesced at the first coalescence 

n-l \ 



1 -£[• 



n \ . 

-Kl) 



n 

Recall the following properties of xj"^ (see [TO], page 1003). The law of xj"^ is given by: if 
p{t) = 1]), for any 1 < /c < n — 1, 

, (n) . (n-2)! fhl - tY-^-h^ pit)dt 
10 > k) = — ) ^ ^ — 

When n tends to oo, its first two moments satisfy E[xj"^] = + 0{n^~'^) (see Lemma 2.3 
in [TO]) and n'^~°'E[{x['''^f] is bounded (see Lemma 2.4 in [TOj). 

Oi 

Hence we get that P(i?o) = r + 0(n""). 

n{a — 1) 

(n) 

Recall that a\ is the number of collisions which occur until individual 1 is concerned. 
Using the asymptotics of F{Hq) and of the two first moments of x["'\ we have: 

(11) E[X ")|(Fo)1 = \^ ^"l '^'^ < 4fMt = -^ + 0{n'-n 
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and 

(12) E[{X, ) /n \{Ho) ] p^^^^-p^ < p((^^),) - 0{n ). 

Let us fix i G (0, 1). Recall that 11^"'*^ is the n-coalescent process associated with i'*{dx) 
j^i^idx), and define X^'*^ as the number of individuals we lose at first jump, and p*{s) 
1]) for any s G (0, 1]. Using the assumption on E[(T-^^'^^)'^], we have 

n-l 
k=2 

n—1 „ n— 1 

k=2 k=2 

\nt\ n-l 



=c2 5:p(i/fc)(^)(-i)^+c2 nHk)q)^''-'^' 

k=2 k=[nt\+l 
[nt] / 

=c2Y.nHk){^)^-'^'+c,n{Ho)i(m^)^^^^^^^ 

k=2 \ ^1 

We study these two terms one by one. 



[ntj [nt\ 

EnHkX^)^-'^' <Y.nHk)il 

k=2 k=2 



2 



"^2 ^" k-l k 



k=2 ^" 



9n 

< ^p(x{"+i'*) >n - LntJ). 
5n 



Using (jlOp for x^""*"^'*^ and k = n — [nt\ , we have 

PfX^"^-^'*) > n - I nth = /o\l-^)L"*^^""L"^V-(^)ci. 
' " (n- LntJ)!(LntJ)! {1 - sy-^sp*is)ds 

Recah that = r(2-a)r(a) + 0(s"°) when s tends to 0. Hence = u*{[s, 1]) = 

ar(a)r('2-a) + 0(s^~"). Moreover r(z) = V2TTz''-^/'^e~''{l + + o(^)) when z tends to oo. 
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Hence we see that there exists two positive constants ^2,1 and ^2,2 (independent of n, but 
may depend on t) such that 

. (n*) , ,^ (n-1)! r„\l-s)L'^*Js"-L"*Js-"ds 

¥{Xf'*> >n - [nt\) < K2 17 ^^^^^T 

^ ' " ^ ''\n - [nt\)l{[nt\y. J^{1 - s)-~\ss~-ds 

- K T{[nt\+l)T{n-[nt\-a + l) 

^'^(n- [nt\)\{[nt\)\ r(n)r(2 - a) 

< K2,2?^~". 

Moreover, Um„_>>oo ^y-^ = 1, then there exists a constant -fC2,3 > (independent of n, but 
may depend on t), such that 

\nt\ 

(13) 5^P(//fe)(^)("-i)^<i^2,3n-" 

k=2 

Let us now consider the second term. There exists a constant -^^2,4 > such that if 
M G (0, 1 - t), then 



(1 - < 1 + (a - + K2,AU^. 



-f^2,4 depends on t. 
Hence 



\ (a-l)/3 

E[| l{l<xW<n-M-l}l(^0)'] 

< E[(l + (a - l)/3^ + i^2,4(^)2)|(/^o)'=] 

n n 

< l + ^+i^2.5ri-", 



for a certain constant i^r2,5. We refer to (fTTj) and (112]) for the last inequality. 

Recah that W{Hq) = — + Oin and /3 < . Hence there exists K2 > Q and 

n(a — 1) " 

A''i > such that for n> Ni, 



\nt\ / 



/n,2<C2j:F(//.)(-)(--^)^ + C2P((ifor) E[(-^)("-^)/^l^^(„^^ 



k=2 \ 



(14) <C2(1-^). 
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We now proceed to In^s- Notice that 
Hence we have 

n-l 

k=2 
^ n—l 

k=2 

^ n—l 
^" k=2 

= (G)^n<"-')/i2''-'lgpm-) 

< (C2)'^n("-i)/32/^-i-y P(Ffc)-- 

Recah (see Remark[5|) that IE[X]^~2 -'-{-fffcif ] = 1 + 0{n~^), and that Qn ~ ^^{0) Hence 
there exists a constant ii'a > such that 

(15) < 

n 

We now conclude with In,i- Recall that by step 1, there exists Ci > such that for all n > 2, 



^.[n'^^^Tf'] < Ci. Then 



n-l 

In,, = n(°-i)%[/32/'-i(^)^-i MH,}fP] 

^" k=2 
n-l 

k=2 



'0 



< Ci/32'^-^E[e(^^^]?ii-"^. 



Hence, there exists K^^ such that 

(16) I„,4 < -^oZY < 

In the last inequality, we use the fact that (3 > 2. 

Using (I8]),(I9D,([I1]),([I5]),([I6]), we have proved that if there exists C2 > such that for all 

2<k<n-l, E[(k°'-^Tl''A'^] < C2, then for any n > Ni 



E[(n"-^ri^"y] <C2 + (Ki - C2K2 + (C2) Vi^3 + K4)^. 
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If we choose C2 large enough such that ^.[(^"^^^T^^'^ ] < C2 for 2 < k < Ni and Ki — 

C2K2 + (02)^ K3 + K4<0, we have proved by induction that {E[(^n°'~^T^"'^y],n > 2) is 
bounded by C2. The proof of the first part of Proposition [5] is achieved. 

Proof of the second part of Proposition [4t 

Here, we prove that for fixed d G N, and /3i, /32, /3d non-negative real numbers such that 
EtiA < we have 

n— !>oo 

For any positive < i < d such that Yli=i Pi < 2^^' Holder's inequality, 

< (E[(n"-ir|"))^tift])Eti/3. ...(E[(n"-^rJ"))Stift])Eti^, 
= E[(n-iT,("))Stift] ^ E[rEtift], 

when n tends to 00. 

So E,[{n°'~^T^^^)^'' ■ ■ ■ {n°'~^Tjp^)^''] is bounded for all n > d. Furthermore, Theorem [3] 
gives that 

converges m distribution to {Tif^ ■ ■ ■ {Tdf". 
We conclude with Lemma [2] to get Corollary 3. 

5. Proof of Theorem [T] and Corollary [2] 
First of all, let us give the proof of Theorem [H Recall that Lg^^ = Y12=i T^"^^. 

E[(n"-^L2 - a{a - l)r(a))2] 
= E[( ^^=^ ^ ^ - a{a - l)V{a)f] 

= ^ _ EK-^rf + EK-^Ti^")] - a(Q - l)r(a))2] 

= E[( ^^=^ ^ ^ - EK-^rf + (EK-^Ti^")] - a(a - l)r(Q))2 

The first part of Theorem U] implies that Var{n'^~^T^"'^) — )• yar(r) and E[n"~^T^^"^] — )• 
E[T] = a{a — l)r(a), when n tends to 00. 

The second part of Theorem U with d = 2 implies that Cov{n'^~^T^^\n°'~^T2^^) converges 
to as n tends to 00. 

Hence E[(n°-242 - "(« - l)r(a))^] converges to 0. This ends the proof of Theorem [TJ 
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The proof of Corollary [2] is straightforward. Recall that (see [2]), n" ^L^"') converges in 
probability to 

aT{a){a — 1) 
2 - a ■ 

Hence we get that L^J'JjL^'^') converges in probability to 2 — a, which is Corollary [2l 
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